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Abstract 

In two player bi-matrix games with partial monitoring, actions played are not 
observed, only some messages are received. Those games satisfy a crucial property 
of usual bi-matrix games: there are only a finite number of required (mixed) best 
replies. This is very helpful while investigating sets of Nash equilibria: for instance, 
in some cases, it allows to relate it to the set of equilibria of some auxiliary game 
with full monitoring. 

In the general case, the Lemke-Howson algorithm is extended and, under some 
genericity assumption, its output are Nash equilibria of the original game. As a by 
product, we obtain an oddness property on their number. 

Introduction 

In finite games, proving the existence of Nash equilibria |1 0^ 1 1 1| is not very challenging, as 
they are fixed points of some correspondence. On the other hand, computing the whole 
set of Nash equilibria (or exhibiting some of its topological properties) is quite hard |12] . 
Similar statements can be made in games where actions chosen or actual payoff mappings 
are (partially) unknown. These games are getting increasing interest and have been 
referred as robust [TJ, ambiguous [2], with uncertainty [6], partially specified [7], and 
so on. Indeed, Nash equilibria are denned similarly as fixed points of some complicated 
- yet regular - correspondence; existence is then ensured, almost always using the very 
same argument of Nash |1U| . Kakutani's fixed point theorem. So the focus shall not be 
existence, but characterizations and computation of these equilibria. 

In full generality and as expected as it is a more complex set-up, this turns out to be a 
very challenging problem [TJ Section 5] . We therefore consider here the class of bi-matrix 
games with partial monitoring, see e.g., [9], which contains all two-player finite games. 
In this framework, players might not observe perfectly their opponent's actions (yet we 
always assume that one knows his own choice); they only receive messages. Depending 
on the game, actions and messages can in fact be correlated as well as independent; we 
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could even assume that the latter is random, but up to some lifting, this can be reduced 
to the deterministic case, see [14] . These games are therefore described by two pair of 
matrices: a first pair for payoffs and a second pair for messages received. 

Players, facing uncertainties upon their payoffs, cannot directly maximize them. As 
it is usual now [5l E], we assume that they optimize their behavior with respect to the 
worst possible scenario, leading to maxmin expected utility. 

Using topological properties of linear mappings and projection, we recover surpris- 
ingly the following fundamental property of finite bi-matrix games with full monitoring 
(when actions are observed). There exists a fixed finite subset of (mixed) actions con- 
taining best-replies to any action of the opponent. While obvious with full monitoring by 
considering whole set of pure actions, this result is not immediate with partial monitoring 
(and actually incorrect in another class of games than the one considered here). 

In the subclass of games called with semi-standard information structure, developed 
in Section [21 this allows the construction of an auxiliary game with full monitoring such 
that its Nash equilibria are (in some sense) also equilibria of the original game. So any 
property with full monitoring holds for this type of games. 

In the general case, this direct reduction is incorrect. Yet we prove in Section 3] 
that Nash equilibria satisfy again another usual properties of full monitoring, see [18] , 
Using this, sets of Nash equilibria are characterized and some of them can be computed 
using the Lemke-Howson algorithm [8], recalled briefly in Section [3j These computations 
are illustrated in Section [5j other claims are also, as often as possible, accompanied by 
examples. Interestingly, since Nash equilibria - even with partial monitoring - are end- 
points of a special instance of the Lemke-Howson algorithm, some oddness property of 
their set is preserved (as soon as some genericity assumption is satisfied). 



1 Two players game with partial monitoring 

Consider a finite two players game T where action of player 1 (resp. player 2) is by A 
(resp. B) and his payoff mapping is u : A x B — > H (resp. v : A x B — > II) , extended 
multi-linearly to X x y . We denote by X = A(A) and y = A(£>) mixed action sets of 
both players. We also assume that they have partial monitoring: they do not observe 
actions of their opponent but receive messages instead, see [9|. Formally, there exist two 
convex compact sets of messages T~L and A4 and two signaling mappings H and M from 
Ax B into T~i or Ai (also extended multi-linearly) such that if players choose x G X and 
y G y, player 1 gets a payoff of u(x,y) but he only observes the message H(x,y) G H. 
On his side, player 2 gets a payoff of v(x,y) and he observes M(x,y) G A4. 

No matter his choice of actions, player 1 cannot distinguish between y and y' G 3^ 
satisfying H (a, y) = H (a, y') for every a G A. We thus define the maximal informative 
mapping H : y — > 7i A (A stands for the cardinality of .4.) by: 



Vyey, H(y)= H(a,y) 



G H A . 
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Similarly, the maximal informative payoff of player 2, M : X — > JV[ B , is defined by 



Vx € X, M(x) = M(x,b) 



£ M 



B 



These linear mappings induce uncertainty correspondences & : y ^ H A and \& : X ~R B 
defined by: 

$ ( y ) = {u(;y') £ R A ; H (y) = H (y)} and * (x) = {u(x', •) € R B ; M (x ) = M (x)} . 

Informally, if player 2 chooses y £ y, then player 1 cannot distinguish it from any other 
y' that have the same image under H; thus, if he plays x £ X, he cannot compute his 
actual payoff as he only infer that it will be on the form {x, U) for some unknown U that 
must belong to &(y) ( which is also equal to <&(?/)). 

When dealing with uncertainties, best replies are extended, following OH], into 

BRi : V(WL A ) ={ X with BR X (U) = argmax inf (x, U) , 

where V(R, A ) is the family of subsets of RA This is well-defined since x \— > inf[/ 6 ^(x, U) 
is concave and upper semi-continuous hence maxima are attained. BR2 : V(R, B ) =4 y is 
defined in a similar way. Definition [1] below of Nash equilibria with partial monitoring 
(see also |14| for more details and explanations) follows naturally. 

Definition 1 (x*,y*) £ X x y is a Nash equilibrium of a game with partial monitoring 
iffx* £ BRx (<%*)) and y* € BR* (*(**)), i.e., iff 

x* £ argmax inf (x,U) and y* £ argmax inf (x,V) . 
xeX U£$(y*) y&y V&<K(x*) 

2 A warm-up: semi-standard structure 

We first consider an easy case: games with a semi-standard information structure. Infor- 
mally, it implies that action sets are partitioned into subsets of undistinguishable actions 
(but it is always possible to distinguish between these subsets). 

Definition 2 The information of player 1 (and similarly for player 2) is semi-standard 
if there exists a partition {Bi ; i £ 1} of B such that 

i) If b and b' belong to the same cell Bi then H(6) = H(fo') = H; and 

ii) The family {H,;; i £ X} is linearly independent, i.e. if Yli^x^^-i = SieX^^* 
then Xi and 73 must be equal, for every i £ I. 

A game has a semi-standard structure if both H and M satisfy these properties. 

In particular, this means that, for every y £ y, given H(y) £ Tl A , player 1 can only 
infer {yt]i £ 1} where j/j = ^beZ3 Vl^l 1S ^ ne probability (accordingly to y) of choosing 
an action in Bi. 



3 



Example 1 IfH = [0, l] d and, no matter b G B, H(a,b) = H(a',b) = where is a 
vector with only one non-zero coordinate which is 1, then player 1 has a semi- standard 
information structure. However, if we do not assume that H(a,b) = H{a',b), then this 
is no longer true. 

Indeed, let A = {a, a'}, B = \b\, b 2 , 63, b^}, T~L = [0, l] 2 and H be represented as 





hi 


b 2 


b 3 


64 


H: a 


ei 


e 2 


ei 




a' 


ei 


e 2 


e 2 


ei 



with e\ = (1, 0) and e 2 = (0, 1). 



The decomposition of point i) of Definition^ must be Hi = (ei,ei), H2 
so on. However, point ii) of the same definition is not satisfied since 



(e 2 ,e 2 ) and 



H 



h + b 2 



ei + e 2 e\ + e 2 



H 



b 3 + 64 



In this framework, following Lemma [T] allows an easy reduction from partial to full 
monitoring. But we need to recall first the general concept of polytopial complex (a 
polytope is the convex hull of a finite number of points^) on which our results rely: 

Definition 3 A finite set {Pk~, k G /C} is a polytopial complex of a polytope P C M, d 
with non-empty interior if: 

i) For every k G /C, C P is a polytope with non empty interior; 

ii) The union Ufce/C ^ k * s e Q ua l t° P> 

Hi) Every intersection of two differents polytopes P^ n Py has an empty interior. 
The following Lemma [1] is an adaptation of an argument stated in |13| Theorem 34] . 



Lemma 1 There exists a finite subset {xf, I G £} of X that contains, for every y £ y, a 
maximizer of the program max x . g ^ min[/ g $( y ) {x, U) and such that its convex hull contains 
the whole set of maximizers. Moreover, there exists a polytopial complex i G £} of 
y such that, for every I G C, xe is a maximizer on y^. 

Similarly, we denote by {y^; k G /C} the set defined in a dual way for player 2. 

Proof: Define, for every i G I, the set of compatible outcomes with Hj by: 

Ui = {u(-,y) G R A ; y s.t. H(y) = H 4 } = co {<,&); b G B{\ , 

where co stands for the convex hull; in particular, Ui = for all b G Bi and it is a 

polytope. So the mapping $ is linear on 3^ sinced it is defined, for every y G y, by 

$(y) =£>% = ££ y[b]Ui =$>[&]*(&). 

1 A polytope can also be defined, in a totally equivalent way, as a compact and non-empty intersection 
of a finite number of half-planes 

2 Actually, the semi-standard structure could also be defined through the linearity of 
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Given x* € X and y G y, if IP € ^(y) is a minimizer of min^/g^) (x*, f7) then it can be 
assumed that U$ is a vertex of because a linear program is always minimized on a 

vertex of the admissible polytope. And necessarily — x* must belong to the normal cone 
to $(y) at pU Theorem 27.4, page 270]. As a consequence, — x* must belong to the 
intersection of —X and a normal cone; more precisely, since (x, IP) is linear, — x* must 
be one of the vertices (or a convex combination of them) of this intersection. 

However, <£(•) is linear on y, so normal cones at vertices - their set is called normal 
fan - are constant, see \22\ Example 7.3, page 193] and [4j page 530]. As a consequence, 
there exists a finite number of intersection between —X and normal cones and they 
all have a finite number of vertices. The set of every possible vertices is denoted by 
— {xf, I € £} and it always contains a maximizer (and any maximizer must belong to 
its convex hull). 

Since is linear, y i— > mmjj^( y \(x£,U) is also linear, for every £ € £; so X£ is a 
maximizer on a polytopial subset of y. □ 

Remark 1 Lemma Q] might be surprising to reader familiar with linear programming. 
Indeed, it is quite clear that ifu\(-,y) is linear then it is always maximized at one of the 
vertices of X . However, in our case, mmu e $( y \{-,U) is not linear but only concave. So 
it can be maximized anywhere in X , even in its interior. 

So without some regularity of the result would obviously e wrong. The key point 
of the proof is that, in our framework, $ is itself induced by the minimization of another 
linear mapping. Lemma{l\holds because min^/gg,^)^, U) is not just any concave mapping, 
but it has this extra specific property. 

We now introduce an auxiliary game T, with full monitoring, such that its Nash 
equilibria somehow coincide with Nash equilibria of V, the original game. Respective 
action sets of player 1 and 2 are £ and fC and payoff mappings 

u(£,k)= min (x£,U) and v(£, k) = min (yk,V). 
Ue0(y k ) Ve9(xe) 

Any pair of mixed actions (x, y) € A(£) x A(/C) induces a pair (x,y) € X x y defined 
by x = E x [x^] € X. This means that, for every a G A, the weight put by x on a is 
x i a ] '■= X^g^xM^M; similarly, y is defined by y = E y [y k ] € y. 

Theorem 2 Every Nash equilibrium of T induces a Nash equilibrium of T and, recipro- 
cally, every Nash equilibrium ofT is induced by a Nash equilibrium ofT. 
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Proof: Let (x, y) be a Nash equilibrium of T and (x, y) the induced mixed actions. By 
linearity of <3?, one has XlfceK: yM^Kl/fc) = ®{y) thus 

«(x, y ) = x M E y fc ) = E x M E y w TT m i? ^ 

— — — — U k £®(Vk) 

= > x[f] min (a;/, U) = > x[fl min (xp,U) 

< min ( > xMa^,£7 > = min (x.U). 

Therefore with, respectively, the fact that (x, y) is a Nash equilibrium, the linearity of $1 
and Lemma [H this implies that 

min (x, U) > S(x, y) > maxu(^, y) = max min (xe, U) = max min (x'U). 

Hence we have proved that x G BRi($(y)); similarly y G BR2( v I / (x)), so (x, y) is a Nash 
equilibrium of I\ 

Reciprocally, let (x,y) be a Nash equilibrium of T. Lemma [1] implies that x is a 
convex combinations of mixed actions in {xf,l G L} that maximize min^/g^) {xp, U). 
Denote by x G A(£) this convex combination and define y in a dual way. 

Since y G A(/C) induces y, then one has, for every £' G C: 

u(£',~y) < max min (x',U) = > x[^] min {x£,U} = u(x, y) , 

where we used respectively the linearity of <I>, the fact that x^ > if xp is a maximizer 
and again the linearity of <£>. Therefore x is a best reply to y and the converse is true 
by symmetry: (x, y) is a Nash equilibrium of T. □ 

Theorem [2] implies that one just has to compute the set of Nash Equilibria of T 
in order to describe the set of Nash equilibria of T. For example, one might consider 
the Lemke-Howson algorithm [8] - or LH-algorithm for short - recalled briefly in the 
following section. 

If r satisfies some non-degeneracy assumption, the LH-algorithm outputs a subset of 
Nash equilibria of both T and T. The specific assumption and how to modify and apply 
this algorithm to any game are detailed in |19] , 



3 Quick reminder on Lemke-Howson algorithm 

The Lemke-Howson algorithm of [8] is designed to compute Nash equilibria of a two- 
player finite game with full monitoring. It is based on the decomposition of X and y 

into best-replies areas. Recall that y a := |y G 3^ s.t. a G argmax a / g ^ u(a', y)| C y, for 
any a G A, is the a-th best-reply area of player 1. The genericity assumption required 
by the LH-algorithm is the following: 
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Assumption 1 {y a ', a € -4} forms a polytopial complex of y and any y € y belongs 
to at most m y best reply areas y a , where m y is the size of the support of y. The similar 
condition holds for {Xb~, b € £>}. 

Stated otherwise, Assumption Q] means that every y E y has at most m y best replies. 

Each y a is a polytope, so denote by V2 and E2 the set of all vertices and edges of 
these sets (necessarily B C V2). For technical purpose, we also assume that V2 contains 
another (abstract) point O2 such that (02,b) belongs to E2 for every b € B. This defines 
a graph Q2 = (V2, E2) over y and similarly a graph Q\ = (Vi, E\) over X . To each vertex 
V2 € V2 (and to each v\ £ B±) is associated the following set of labels: 



i.e., its best replies and pure actions on which it does not put any weight. Label sets of 
abstract points Oi and O2 are L(0i) = A and L(02) = B. 

This induces a product labelled graph Qq = (Vo, Eq) over X x y, whose set of vertices 
is the cartesian product Vq = V\ X V2 and such that there exists an edge in Eq between 
(v\,V2) and (v[, v' 2 ) if and only if v\ = v[ and (v2,v' 2 ) £ E2 or V2 = v' 2 and (v 1, v[) G E\. 
The set of labels of (^1,^2) is L(y\,V2) = L{v\) L)L(v2)- 

Nash equilibria are exactly fully labeled pairs (^1,^2), i-e., if L{v\,V2) = A\ U A2', 
indeed, this means that an action a is either not played (if v\ [a] = 0) or a best reply to 
V2 (if V2 € y a )- The LH-algorithm walks along edges of Qq, from vertices to vertices, and 
stops at a one of those points. We describe quickly in the remaining of this section how 
it works generically (i.e. for almost all games); for more details we refer to |18| [20] and 
references therein. 

Starting at vq = (0i,02) (which is fully labeled), one label t in A U B is chosen 
arbitrarily. The LH algorithm visits sequentially almost fully labeled vertices (vt)tew of 
Go, i.e., points such that L(vt) D AU B\{£} and (vt,Vt+i) is an edge in Eq. Generically, 
at any vt there exists at most one point (apart from Vf-i) satisfying both properties, and 
any end point must be fully labeled. 

As a consequence, when starting from any almost fully labeled point vertex, LH 
algorithm follows either a cycle (and stops when returning to a previously visited point) 
or a path whose endpoints are necessarily Nash equilibria (or (0i,02)). This property 
can be used, for example, to prove that the number of Nash equilibria is generically odd. 

4 Characterization and computation of Nash equilibria 

Without the semi-standard structure, Lemma [1] and Theorem [2] might not hold since 
<3? is not linear (this is illustrated in Example [2]). However, we will show that, in the 
general case, we still have a similar property: is piece-wise linear. This means that $ 
is linear on a polytopial complex of y (see the following Lemma [3]). Using this, it will 
be easy to show (in Lemma [4] below) that best-replies areas forms a polytopial complex, 
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allowing the generalization of LH-algorithm. Such decompositions have been recently 
used in related frameworks, see e.g. |21j . 



Example 2 Assume that A = {T; B}, B = {L,C,R} and % = [0,1]. Payoffs and 
player l's message matrices (player 2 has full monitoring) are given respectively by: 





L 


C 


R 




L 


C 


R 


u: T 


(1,1) 


(0,0) 


(0,0) 


H: T 





1 


1/3 


B 


(0,0) 


(1,2) 


(1,0) 


B 





1 


1/3 



Player 1 cannot distinguish between the mixed action 2/3L + 1/3C and the pure action R. 

Following notations of Lemma [7J one has {x^; £ G £} = {T,B,M} where M = 
1/2T + 1/25 an^ {y^; k G /C} = {L, C, R}. Thus T is defined by the following matrix: 





L 


C 


R 


T 


(1,1) 


(0,0) 


(0,0) 


B 


(0,0) 


(1,2) 


(1/3,0) 


M 


(1/2,1/2) 


(1/2,1) 


(1/2,0) 



This game has three Nash Equilibria: (T, L), (B,C) and (2/3T+ 1/35,1/21, + 1/2C). 
Although the first two are indeed Nash equilibria of T, this is not true for the last one. 
Indeed, $(1/2L + 1/2C) = {(A/2; 1 - A 2 ); A G [0, 1]} and its best response is {T}. 

Actually, and as we shall see in Example T has three Nash equilibria which are 
(T, L), (B,C) and (1/3T + 2/3M, 3/4L + 1/4C) = (2/3T + 1/35, 3/4L + 1/4C) 

Lemma 3 The correspondence is piecewise linear on y. 

Proof: Since H is linear from 3^ into rl , then fi i— > H~ 1 (/i) is piecewise linear on 
H A , see [H page 530] and |15^ Proposition 2.4, page 221]. Therefore, by composition, 

y i—)- H _1 (H(y)^ is piecewise linear on 3^ and y i— > u^-, H" 1 (H(y)^ - which is by 

definition $ - is also piecewise linear on y. □ 

So Lemma [TJ can be rephrased as follows. 

Lemma 4 There exists a finite subset {x^; I G £} of X that contains, for every y G y, a 
maximizer of the program max ie ^ min^/g^) {x, U) and such that its convex hull contains 
the set of maximizers. 

Moreover, for every I G C, X£ is a maximizer on yi which is a finite union of poly topes. 
Similarly, we denote by {y^; k G /C} and {X^; k G /C} the finite sets for player 2. 

Proof: One just has to consider the polytopial complex {Pi; i G 1} with respect to 
which $ and are piecewise linear and apply Lemma [1] on each Pj. □ 

Our main result is the following characterization of Nash equilibria in a general game 
with partial monitoring. We recall that x G A(£) induces the mixed action x = E x [x^] 
where x[a], the weight put by x on a G A, is equal to ^^e£ x M ;E ^[ a ]- 



3 To be extremely rigorous, the pure action 7? should be removed since it is never a best response. 
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Theorem 5 Nash equilibria of Gh o,re induced by points in A(£) x A(/C) that are fully 
labelled with respect to the two decompositions {Y^; i £ £} and {X^; A; £ /C} ('and to 
toe /a&eZ set £\JK) defined by 

Y e = jy G A(/C) s.t. E y [y fe ] 6 = I y € A(/C) s.t. x e G arg max inf (a*,E/) 
and similarly for 

Proof: Consider any fully labelled point (x, y) G A(£) x A(/C) and the induced mixed 
actions x G X and ?/ G y. By definition (see Section [3j), for every I £ £ and k G /C, 
either x[£] = or y belongs to Y^ (and similarly either y[k] = or x G X^). 
As a consequence, x is a best reply to y (and reciprocally) since: 

min (x,U) = min > x[£l(x/,?7) > > x[£| min (xp,U) > max min (x',U). 

Therefore, any fully labelled point induces a Nash equilibrium of V. 

Reciprocally, if (x,y) is a Nash equilibrium of T then Lemma H] implies that x and y 
belong to the convex hull of {xf, £ G £} and {y^; k G JC}. More precisely, x is a convex 
combination of the maximizers of mmu & $^{xi,U) (i.e. those xe such that y G Yg). If 
we denote this convex combination as x = X^g-C*-^] 2 ^' t nen necessarily either x[£] =0 
or y belongs to Y? (and y G Y^). Therefore (x, y) is fully labeled. □ 

It remains to describe why the LH algorithm can be used in this framework. First, 
recall that every set Yi or provided by Lemma 3] is a finite union of polytopes. 
So, up to an arbitrary subdivision of these non-convex unions (associated with maybe a 
duplication of some mixed actions, see Example |3]below), we can assume that {Yf, £ G £■} 
and {X^; k G /C} are finite families of polytopes. 

Lemma 6 Any element of the families {Y;; I G £} and {X&; k G /C} is a polytope. 
Proof: Since, by definition, 



Yf = < y G y s.t. xe G arg max inf (xpi, U) 
{ e'ecue®( y y 

is a polytope of H B , there exists a finite family [bt G Q G H; t G Tp] such that 

Yt=C\ {y^^s.t. (y,6 t ) <c t ) . 
teT e 

Therefore, Yp is also a polytope of H K as it can be written as 

Y t =f] {yGA(/C)s.t.(E y [y]A)<Q}= f| {y e A(JC) s.t. <y, ((y k , b t )) keK ) < c t }. 

Similar arguments hold for {X^; k G /C}. □ 

Using this important property, we can generalize the LH-algorithm to games with 
uncertainties satisfying some non-degeneracy assumptions. 
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Theorem 7 If {Yf, £ £ £} and {X^,; /c € /C} satisfy Assumption^ then any end-point 
of Lemke-Howson algorithm induces a Nash equilibrium ofT. 

Proof: If {Y^; £ € £} and {X^; k E JC} satisfy the non-degeneracy Assumption [H any 
end point of the LH-algorithm is fully labelled, hence a Nash equilibrium of T. □ 

Remark 2 It is not compulsory to use the induced polytopial complexes of A(£) and 
A (/C) . One can work directly in X and y by considering the projection of the skeleton of 
the complexes {Y^; I £ £} and {X&; k 6 /C} onto them. However, the graphs generated 
might not be planar and there are, at first glance, no guarantee that the LH-algorithm will 
work. In the proof of Theorem it is a lifting of the problem that ensures that graphs 
are planar. 

The fact that there was an odd number of Nash equilibria in the game of Example [2] 
(continued in Section [5] below) is therefore not surprising; with full monitoring and the 
non-degeneracy assumption, this can be proved using the LH-algorithm. Therefore, as 
soon as {Y^; £ 6 £.} and {X&; k € /C} satisfy this assumption, there will exist an odd 
number of fully labelled points in A(£) x A(/C) inducing Nash equilibria. 

In some cases, the main argument of the proof of Theorem [5] can be rephrased as 
follows. The game T is, in fact, equivalent to a game V with full monitoring, with action 
spaces C and K, and with payoffs defined in a arbitrary way so that the polytopial com- 
plexes induced by the best-replies areas coincide with {Y^; £ € £} and {X&; k € /C}. 
However, the existence of such abstracts payoffs might not be ensured in general (or 
it can depend on the duplication of the mixed actions chosen, see Example [3]). Any- 
way, whenever it is possible, it is again almost instantaneous to understand that Nash 
equilibria of T and T coincide. 

Example 3 Consider the game defined by, respectively, the following payoffs and signal 
( in M 2 ) matrices for the row player: 





L 


M 


C 


R 




L 


M 


c 


R 


T 


4 


4 


4 





T 


(0,0) 


(0,1) 


(1,0) 


(1,1) 


B 


3 


3 


3 


3 


B 


(0,0) 


(0,1) 


(1,0) 


(1,1) 



Given the signal (a,/3) € [0, l] 2 , the best response is B if a and (3 are both bigger than 
0.25 and the best response is T is either a or (5 is smaller than 0.25. Therefore Yb is 
convex but Yt is not (but it is the union of two polytopes). 

Assume that the column player has a full monitoring and that his four action might 
be best responses, then Y^ is not convex and the decomposition {Y^, Y^} cannot be 
induced by some equivalent game with full monitoring. 

On the other hand, one can find a decomposition of Yt into two polytopes, namely 
Y Tl = H^ 1 ({(a, p) € [0, l] 2 s.t. a < min(0.25, /3)}) and similarly Yp 2 = H^ 1 ({(a, /3) 6 

[0, l] 2 s.t. (3 < min(0.25, a)}^ . It is easy to see that {Y^, Y^ 2 , Yg} can be induced by 
some completely auxiliary game with full monitoring - this decomposition is said to be 
regular, see \2H\ Definition 5.3 and page 132]. And with respect to this decomposition, x € 
A({Ti, T2, B}) induces the mixed action x £ A({T, B}) defined by x[T] = x[Ti] + xp~2]. 
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5 Examples with partial monitoring or in robust games 



Consider again Example [21 The polytopial complexes {Yf, £€£,}:= {Yb,Ym ,Yt} an d 
{X^; k € /C} := {Xc,Xl} are represented in the following figure [TJ 

I 1 1 

C B Q c M Ql T 




Figure 1: On the left X and on the right y and A(/C) with their complexes. 

In order to describe how the LH-algorithm works, we will denote a vertex of the 
product graph by the cartesian product of its labels (in this example the set of labels is 
{T, B, M, L, R,C}); for example the vertex represented with a black dot in figure [1] is 
denoted by {R,T,M} x {B,C,L}. 

The first step in the LH-algorithm is to drop one label arbitrarily; If the label M is 
dropped then the first vertex visited by the algorithm is {L, R, C} x {B, T, C}. The label 
C appears twice, so in order to get rid of one of them, the algorithm chooses at the next 
step the vertex {L, R, B} x {B, T, C} and the following vertex is {L, R, B} x {M, T, C}. 
It is fully labelled, thus an end point of the algorithm, hence (B, C) G X x 3^ is a pure 
Nash equilibrium of T. 

Similarly, If T is dropped at the first stage, then the first vertex is {L,R,C} x 
{B, M, L} and the second {R, C, T} x {B, M, L}. So (T, L) is also a pure Nash equilib- 
rium of r. 

Starting again from this point and dropping the label C makes the LH-algorithm visit 
{R, T, M } x {B, M, L}, and then {R, T,M}x {B, L, C} which is also a Nash equilibrium. 
It corresponds to (x,y) = (1/3T + 2/3M,3/4L + 1 /AC) € A(£) x A(/C) which induces 
(x, y) = (2/3T + 1/3B, 3/AL + 1/4C) which is a (mixed) Nash equilibrium of T. 

One can check the remaining vertices of the product graph to be convinced that there 
does not exist any more equilibria. 

We now quickly treat the case of robust games where players observe their opponents 
actions but their payoff mapping is unknown; the only information is that u belongs to 
some polytope U (and v to some V). Then under those assumptions uncertainties 
correspondence <I> and ^ might not be piece-wise linear. 
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Example 4 Assume that the payoff matrix of player 1 belongs to the convex hull of the 
following two matrices, i.e., U = {Xu\ + (1 — A)u2; A € [0, 1]} with 



Ul 





L 


R 




L 


R 


Zi 


1 





T x 








T 2 








and U2 = Ti 


1 





Bi 





1 


Bi 








B 2 








B 2 





1 



Then for any y € [0, 1] ; 



$(yL+(l-y)R 



2/(1 -A) 

(i-y)A 
\ (l-y)(l-A) / 



A € [0, 1] 



which is not piece-wise linear in y. Indeed &{yL + (1 — y)R) can be seen as the set of 
product probability distributions over {T, B} x {1, 2} with first marginal yT + (1 — y)B. 

As a consequence, Lemma S| might not hold so Lemke-Howson algorithm can not, 
in general, be extended (see e.g., [H Section 5] for alternative technics). On the other 
hand, if both players have only 2 actions, then it is not difficult to see that $ and $ are 
piecewise linear (as they cannot turn as in higher dimensions); so in that specific case, 
our results extend. 



Some open questions 

Important questions remains open. We have shown that under some regularity (or non- 
degeneracy) assumption on the decomposition into best reply areas, Nash equilibria are 
induced by an odd number of points. The characterization of such games (maybe as 
a large semi-algebraic class or such that a game chosen uniformly in some open ball 
satisfy it with probability one) appears to be a real challenging problem here. With 
full monitoring, one just has to check that vectors u(-, b) and v(a, ■) are in some generic 
position. With partial monitoring, one must first control the fact that xg and y& are 
themselves in generic position, then that and also satisfy regularity conditions; 
moreover, genericity can be described with respect to the mappings u and v (as in full 
monitoring) or to H and M, or to simultaneously all of them. Answering this question 
will most probably require a deeper understanding of how normal cones evolve with 
u, v, H and M. 

Other questions concern wether index and stability of these equilibria can be defined 
and studied, see [17] : for instance, we can wonder which equilibria remains in a neigh- 
borhood of a given game. The complexity of computing these equilibria, and wether it 
is in the same class than with full monitoring |12] . must also be addressed. 

Acknowledgments: I am grateful to S. Sorin for his - as always - useful comments 
and to F. Riedel, B. von Stengel and G. Vigeral for their wise remarks. 
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